On support vector decision trees for database marketing
نویسندگان
چکیده
We introduce a support vector decision tree method for customer targeting in the framework of large databases (database marketing). The goal is to provide a tool to identify the best customers based on historical data. Then this tool is used to forecast the best potential customers among a pool of prospects. We begin by recursively constructing a decision tree. Each decision consists of a linear combination of independent attributes. A linear program motivated by the support vector machine method from Vapnik’s Statistical Learning Theory is used to construct each decision. This linear program automatically selects the relevant subset of attributes for each decision. Each customer is scored based on the decision tree. A gainschart table is used to verify the goodness of fit of the targeting, to determine the likely prospects and the expected utility or profit. Successful results are given for three industrial problems. The method consistently produced trees with a very small number of decision nodes. Each decision consisted of a relatively small number of attributes, which evinces the method’s power of dimensionality reduction. The largest training dataset tested contained 15,700 points with 866 attributes. The commercial optimization package used, CPLEX, is capable of solving even larger problems.
منابع مشابه
Sustainable Supplier Selection by a New Hybrid Support Vector-model based on the Cuckoo Optimization Algorithm
For assessing and selecting sustainable suppliers, this study considers a triple-bottom-line approach, including profit, people and planet, and regards business operations, environmental effects along with social responsibilities of the suppliers. Diverse metrics are acquainted with measure execution in these three issues. This study builds up a new hybrid intelligent model, namely COA-LS-SVM, ...
متن کاملMixture of experts for classification of gender, ethnic origin, and pose of human faces
In this paper we describe the application of mixtures of experts on gender and ethnic classification of human faces, and pose classification, and show their feasibility on the FERET database of facial images. The FERET database allows us to demonstrate performance on hundreds or thousands of images. The mixture of experts is implemented using the "divide and conquer" modularity principle with r...
متن کاملArtificial Intelligence Techniques for Unbalanced Datasets in Real World Classification Tasks
In this chapter a survey on the problem of classification tasks in unbalanced datasets is presented. The effect of the imbalance of the distribution of target classes in databases is analyzed with respect to the performance of standard classifiers such as decision trees and support vector machines, and the main approaches to improve the generally not satisfactory results obtained by such method...
متن کاملAn Evaluation of Support Vector Machines in Consumer Credit Analysis
This thesis examines a support vector machine approach for determining consumer credit. The support vector machine using a radial basis function (RBF) kernel is compared to a previous implementation of a decision tree machine learning model. The dataset used for evaluation was provided by a large bank and includes relevant consumer-level data, including transactions and credit-bureau data. The ...
متن کاملEvaluation of liquefaction potential based on CPT results using C4.5 decision tree
The prediction of liquefaction potential of soil due to an earthquake is an essential task in Civil Engineering. The decision tree is a tree structure consisting of internal and terminal nodes which process the data to ultimately yield a classification. C4.5 is a known algorithm widely used to design decision trees. In this algorithm, a pruning process is carried out to solve the problem of the...
متن کامل